Round 1: Technical (AWS Data Engineering and Spark)
β Introduction
π Tell me about yourself and any recent projects you have been a part of.
π Questions related to your projects.
β Technical Questions
π How would you fetch data from an API, and how would you build a pipeline to process it using AWS services like Lambda, API Gateway, and S3?
π How would you migrate data from an on-premises SQL database to AWS?
π How does the AWS Glue service connect to an on-premises SQL Server to extract data?
π How would you create a pipeline in AWS Data Pipeline to copy only recently modified files from S3 buckets to an output bucket in the same S3?
β Scenario-Based Questions
π Explain how you would implement partitioning and bucketing in Spark for data stored in S3 to optimize query performance.
π How would you design a data pipeline for near real-time data ingestion and processing from various sources (e.g., S3, Kinesis) to Redshift using Spark?
π Describe how you would use AWS Glue for orchestration and scheduling of Spark jobs.
π How do you handle incremental updates in a data lake architecture using AWS services and Spark?
π What is your approach to monitoring and logging Spark jobs in AWS, and how do you track performance issues?
β AWS Service-Specific Questions
πScenario-based questions related to AWS Data Pipeline, Glue, or Lambda.
Round 2: Team Fitment and Company Understanding (15 Minutes)
β Company Knowledge
π What do you know about EPAM?
β Team Fitment
π Questions to assess your compatibility with the team and work culture.
β Reason for Change
π What is your reason for seeking a job change?
Round 3: HR Interview
β Experience and Projects
π Discuss your experience and recent projects.
π Resume-specific questions related to your skills and achievements.
β Role Expectations and Fit
π What are you expecting in your next job role?
π How soon can you join the company, and what is your preferred location?